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Abstract 

We propose a combination of probabilistic reasoning from 
conditional constraints with approaches to default reasoning 
from conditional knowledge bases. In detail, we generalize 
the notions of Pearl's entailment in system Z, Lehmann's lex- 
icographic entailment, and Geffner's conditional entailment 
to conditional constraints. We give some examples that show 
that the new notions of z-, lexicographic, and conditional en- 
tailment have similar properties like their classical counter- 
parts. Moreover, we show that the new notions of z-, lexi- 
cographic, and conditional entailment are proper generaliza- 
tions of both their classical counterparts and the classical no- 
tion of logical entailment for conditional constraints. 

Introduction 

In this paper, we elaborate a combination of probabilistic 
reasoning from conditional constraints with approaches to 
default reasoning from conditional knowledge bases. As a 
main result, this combination provides new notions of en- 
tailment for conditional constraints, which respect the ideas 
of classical default reasoning from conditional knowledge 
bases, and which are generally much stronger than the clas- 
sical notion of logical entailment based on conditioning. 
Moreover, the results of this paper can also be applied for 
handling inconsistencies in probabilistic knowledge bases. 

Informally, the ideas behind this paper can be described as 
follows. Assume that we have the following knowledge at 
hand: "all penguins are birds" (Gl), "between 90 and 95% 
of all birds fly" (G2), and "at most 5% of all penguins fly" 
(G3). Moreover, assume a first scenario in which "Tweety is 
a bird" (El) and second one in which "Tweety is a penguin" 
(E2). What do we conclude about Tweety's ability to fly? 

A closer look at this example shows that the statements 
G1-G3 describe statistical knowledge (or objective knowl- 
edge), while El and E2 express degrees of belief (or subjec- 
tive knowledge). One way of handling such combinations of 
statistical knowledge and degrees of belief is reference class 
reasoning, which goes back to Reichenbach (1949) and was 
further refined by Kyburg (1974; 1983) and Pollock (1990). 

Another related field is default reasoning from conditional 
knowledge bases, where we have generic statements of the 
form "all penguins are birds", "generally, all birds fly", and 
"generally, no penguin flies" in addition to some concrete 



evidence as El and E2. The literature contains several differ- 
ent approaches to default reasoning and extensive work on 
the desired properties. The core of these properties are the 
rationality postulates proposed by Kraus etal. (1990). These 
rationality postulates constitute a sound and complete ax- 
iom system for several classical model-theoretic entailment 
relations under uncertainty measures on worlds. In detail, 
they characterize classical model-theoretic entailment under 
preferential structures (Shoham 1987; Kraus et al. 1990), 
infinitesimal probabilities (Adams 1975; Pearl 1989), possi- 
bility measures (Dubois & Prade 1991), and world rankings 
(Spohn 1988; Goldszmidt & Pearl 1992). . They also char- 
acterize an entailment relation based on conditional objects 
(Dubois & Prade 1994). A survey of all these relationships 
is given in (Benferhat et al. 1997). Recently, Friedman and 
Halpern (2000) showed that many approaches describe to 
the same notion of inference, since they are all expressible 
as plausibility measures. 

Mainly to solve problems with irrelevant information, the 
notion of rational closure as a more adventurous notion 
of entailment has been introduced by Lehmann (Lehmann 
1989; Lehmann & Magidor 1992). This notion of entailment 
is equivalent to entailment in system Z by Pearl (1990), to 
the least specific possibility entailment by Benferhat et al. 
(1992), and to a conditional (modal) logic-based entailment 
by Lamarre (1992). Finally, mainly in order to solve prob- 
lems with property inheritance from classes to exceptional 
subclasses, the maximum entropy approach to default entail- 
ment was proposed by Goldszmidt et al. (1993); the notion 
of lexicographic entailment was introduced by Lehmann 
(1995) and Benferhat et al. (1993); the notion of conditional 
entailment was proposed by Geffner (Geffner 1992; Geffner 
& Pearl 1992); and an infinitesimal belief function approach 
was suggested by Benferhat et al. (1995). 

Coming back to our introductory example, we realize 
that G1-G3 and E1-E2 represent interval restrictions for 
conditional probabilities, also called conditional constraints 
(Lukasiewicz 1999b). The literature contains extensive 
work on reasoning about conditional constraints (Dubois & 
Prade 1988; Dubois et al. 1990; 1993; Amarger et al. 1991; 
Jaumard et al. 1991; Thone et al. 1992; Frisch & Haddawy 
1994; Heinsohn 1994; Luo et al. 1996; Lukasiewicz 1999a; 
1999b) and their generalizations, for example, to probabilis- 
tic logic programs (Lukasiewicz 1998). 



Now, the main idea of this paper is to use techniques for 
default reasoning from conditional knowledge bases in order 
to perform probabilistic reasoning from statistical knowl- 
edge and degrees of beliefs. More precisely, we extend 
the notions of entailment in system Z, Lehmann's lexico- 
graphic entailment, and Geffner's conditional entailment to 
the framework of conditional constraints. 

Informally, in our introductory example, the statements 
G2 and G3 are interpreted as "generally, a bird flies with 
a probability between 0.9 and 0.95" (G2*) and "generally, 
a penguin flies with a probability of at most 0.05" (G3*), 
respectively. In the first scenario, we then simply use the 
whole probabilistic knowledge {G1,G2*,G3*,E1} to con- 
clude under classical logical entailment that "Tweety flies 
with a probability between 0.9 and 0.95". In the second 
scenario, it turns out that the whole probabilistic knowl- 
edge {G1,G2*,G3*,E2} is unsatisfiable. More precisely, 
{G1,G2*,G3*} is inconsistent in the context of a pen- 
guin. In fact, the main problem is that G2* should not be 
applied anymore to penguins. That is, we can easily re- 
solve the inconsistency by removing G2*, and then conclude 
from {G1,G3*,E2} under classical logical entailment that 
"Tweety flies with a probability of at most 0.05". 

Hence, the results of this paper can also be used for 
handling inconsistencies in probabilistic knowledge bases. 
More precisely, the new notions of nonmonotonic entail- 
ment coincide with the classical notion of logical entailment 
as far as satisfiable sets of conditional constraints are con- 
cerned. Furthermore, they allow desirable conclusions from 
certain kinds of unsatisfiable sets of conditional constraints. 

We remark that this inconsistency handling is guided by 
the principles of default reasoning from conditional knowl- 
edge bases. It is thus based on a natural preference relation 
on conditional constraints, and not on the assumption that 
all conditional constraints are equally weighted (as, for ex- 
ample, in the work by Jaumard et al. (1991)). 

The work closest in spirit to this paper is perhaps the one 
by Bacchus et al. (1996), which suggests to use the ran- 
dom worlds method (Grove et al. 1994) to induce degrees 
of beliefs from quite rich statistical knowledge bases. How- 
ever, differently from (Bacchus et al. 1996), we do not make 
use of a strong principle such as the random worlds method 
(which is closely related to probabilistic reasoning under 
maximum entropy). Moreover, we restrict our considera- 
tions to the propositional setting. 

The main contributions of this paper are as follows: 

• We illustrate that the classical notion of logical entailment 
for conditional constraints is not very well-suited for de- 
fault reasoning with conditional constraints. 

• We introduce the notions of z-entailment, lexicographic 
entailment, and conditional entailment for conditional 
constraints, which are a combination of the classical no- 
tions of entailment in system Z (Pearl 1990), Lehmann's 
lexicographic entailment (Lehmann 1995), and Geffner's 
conditional entailment (Geffner 1992; Geffner & Pearl 
1992), respectively, with the classical notion of logical 
entailment for conditional constraints. 

• We give some examples that analyze the nonmonotonic 



properties of the new notions of entailment for default rea- 
soning with conditional constraints. It turns out that the 
new notions of z-entailment, lexicographic entailment, 
and conditional entailment have similar properties like 
their classical counterparts. 

• We show that the new notions of z-entailment, lexico- 
graphic entailment, and conditional entailment for con- 
ditional constraints properly extend the classical notions 
of entailment in system Z, lexicographic entailment, and 
conditional entailment, respectively. 

• We show that the new notions of z-entailment, lexico- 
graphic entailment, and conditional entailment for condi- 
tional constraints properly extend the classical notion of 
logical entailment for conditional constraints. 

Note that all proofs are given in (Lukasiewicz 2000). 

Preliminaries 

We now introduce some necessary technical background. 

We assume a finite nonempty set of basic propositions 
(or atoms) <£>. We use _L and T to denote the propositional 
constants false and true, respectively. The set of classical 
formulas is the closure of $ U {_L, T} under the Boolean 
operations -i and A. A strict conditional constraint is an ex- 
pression u] with real numbers l,ue [0, 1] and clas- 
sical formulas ip and <j). A defeasible conditional constraint 
(or default) is an expression (^H^)^, u] with real numbers 
l,u£ [0, 1] and classical formulas ip and <p. A conditional 
constraint is a strict or defeasible conditional constraint. 
The set of strict probabilistic formulas (resp., probabilis- 
tic formulas) is the closure of the set of all strict condi- 
tional constraints (resp., conditional constraints) under the 
Boolean operations -i and A. We use (FvG), (F=>G), 
and (F G) to abbreviate ->(->F A ->G), ->(F A -.G), and 
(-<(->F A G)) A (^(F A -'G)), respectively, and adopt the 
usual conventions to eliminate parentheses. 

A probabilistic default theory is a pair T = (P, D), where 
P is a finite set of strict conditional constraints and D is 
a finite set of defeasible conditional constraints. A proba- 
bilistic knowledge base KB is a strict probabilistic formula. 
Informally, default theories represent strict and defeasible 
generic knowledge, while probabilistic knowledge bases ex- 
press some concrete evidence. 

A possible world is a truth assignment I : $ — ► {true, 
false}, which is extended to classical formulas as usual. We 
use 2$ to denote the set of all possible worlds for <£>. A pos- 
sible world I satisfies a classical formula <p, or I is a model 
of <j>, denoted I \= <j), iff I(4>) = true. 

A probabilistic interpretation Pr is a probability func- 
tion on 2$ (that is, a mapping Pr : 2$ — > [0, 1] such that all 
Pr(I) with / e 2$ sum up to 1). The probability of a classi- 
cal formula <ft in the probabilistic interpretation Pr, denoted 
Pr(4>), is defined as follows: 

Pr(cP) = £ Pr(I). 

For classical formulas <j> and ip with Pr{(f>) > 0, we use 
Pr(tp\(j)) to abbreviate Pr(tp A (f>) / Pr{<p). The truth of 



probabilistic formulas F in a probabilistic interpretation Pr, 
denoted Pr \= F, is inductively defined as follows: 

• Pr\={i>\<l>)[l,u] iff Pr(<t>) = or Pr(i>\(f>)€ [I, u]. 

• Pr \= (ip\\(p)[l,u\ iff Pr(<p) = 0or Pr(ip\<j>)€[l,u}. 

• Pr^^F iff not Pr \= F. 

• Pr \= (F A G) iff Pr\=F and Pr \= G. 

We remark that there is no difference between strict and 
defeasible conditional constraints as far as the notion of truth 
in probabilistic interpretations is concerned. 

A probabilistic interpretation Pr satisfies a probabilistic 
formula F, or Pr is a model of F, iff Pr \= F. Pr satisfies 
a set of probabilistic formulas T, or Pr is a model of T, 
denoted Pr \= T, iff Pr is a model of all F . We say T 
is satisfiable iff a model of T exists. 

We next define the notion of logical entailment as fol- 
lows. A strict probabilistic formula F is a logical conse- 
quence of a set of probabilistic formulas T, denoted T \= F, 
iff each model of T is also a model of F. A strict condi- 
tional constraint (^>|<^)[Z, u] is a tight logical consequence 
of T, denoted T \= tight (V'l^ , )['> u ]> ^ ' ( res P-> u ) is tne m " 
fimum (resp., supremum) of Pr(tp\4>) subject to all models 
Pr of T with Pr{4>) > (note that we canonically define 
Z = landu = 0,when^"|=(^|T)[0,0]). 

We remark that every notion of entailment for conditional 
constraints is associated with a notion of consequence and a 
notion of tight consequence. Informally, the notion of con- 
sequence describes entailed intervals, while the notion of 
tight consequence characterizes the tightest entailed inter- 
val. That is, if (ib\(b)[l, u] is a tight consequence of T, then 
[/', u'} D [l,u] for all consequences (tp\ </>)[/', u'] of T. 

Motivating Examples 

What should a probabilistic knowledge base entail under a 
probabilistic default theory? To get a rough idea on the re- 
ply to this question, we now introduce two natural notions of 
entailment and analyze their properties. It will turn out that 
neither of these two notions is fully adequate for probabilis- 
tic default reasoning with conditional constraints. 

In the sequel, let T = (P, D) be a probabilistic de- 
fault theory. We first define the notion of O-entailment, 
which applies to probabilistic knowledge bases of the 
form KB = (e|T)[l, 1]. In detail, a strict conditional con- 
straint (tp\(j))[l, u] is a ^-consequence of KB, denoted 
KB \^°(^\<j))[l } u}, iff PUD \= A e)[/,u]. It is a tight 
^-consequence of KB, denoted KB \y tight (V#)[Z, u], iff 
PUD \=tight (^10 A e)[Z, w]. Informally, we use the con- 
crete evidence in KB to fix our "point of interest" and the 
generic knowledge in T to draw the requested conclusion. 
That is, we perform classical conditioning. 

We next define the notion of 1 -entailment, which applies 
to all probabilistic knowledge bases KB. A strict prob- 
abilistic formula F is a 1-consequence of KB, denoted 
KB H~'F,iff PUDU{KB} \= F. A strict conditional con- 
straint (ib\4>) [I, u] is a tight 1-consequence of KB, denoted 
KB\^ tight (^)[l,u}, iff PuDU{KB}\= tight W<t>)[l,u]. 



Informally, we draw our conclusion from the union of the 
concrete evidence in KB and the generic knowledge in T. 

We now analyze the properties of these two notions of 
entailment. Our first example concentrates on the aspects of 
ignoring irrelevant information and property inheritance. 

Example 1 The knowledge "all penguins are birds" and 
"at least 95% of all birds have legs" can be expressed by 
the following probabilistic default theory T\ = (Pi, Di): 

Pi = {(bird\penguin)[l,l}}, 

Di = {(legs\\bird)[M,l]}. 

Now, T\ should entail that "generally, birds have legs 
with a probability of at least 0.95" (that is, e.g., if we know 
that Tweety is a bird, and we do not have any other knowl- 
edge, then we should conclude that the probability of Tweety 
having legs is at least 0.95). Indeed, this conclusion is drawn 
under both 0- and 1-entailment (see item (1) in Table 1). 

Moreover, T\ should entail that "generally, yellow birds 
have legs with a probability of at least 0.95" (as the property 
"yellow" is not mentioned at all in T\ and thus irrelevant), 
and that "generally, penguins have legs with a probability of 
at least 0.95" (as the set of all penguins is a nonexceptional 
subclass of the set of all birds, and thus penguins should 
inherit all properties of birds). However, while 1-entailment 
still allows the desired conclusions, 0-entailment just yields 
the interval [0, 1] (see items (2)-(3) in Table 1). □ 

We next concentrate on the principle of specificity and the 
problem of inheritance blocking. 

Example 2 Let us consider the following probabilistic de- 
fault theory T 2 = {P2,D 2 ): 

Pi = {(bird\penguin)[l,l}}, 

D 2 = {(legs\\bird)[M,l], (fly\\bird)[.9, .95], 

(fly \\penguin)[0, .05]} . 

This default theory should entail that "generally, penguins 
fly with a probability of at most 0.05" (as properties of 
more specific classes should override inherited properties of 
less specific classes). Indeed, 0-entailment yields the de- 
sired conclusion, while 1-entailment reports an unsatisfia- 
bility (see item (7) in Table 1). 

Moreover, T 2 should entail that "generally, penguins have 
legs with a probability of at least 0.95", since penguins are 
exceptional birds w.r.t. to the ability of being able to fly, but 
not w.r.t. the property of having legs. However, 0-entailment 
provides only the interval [0,1], and 1-entailment reports 
even an unsatisfiability (see item (5) in Table 1). □ 

The following example deals with the drowning problem 
(Benferhatef al. 1993). 

Example 3 Let us consider the following probabilistic de- 
fault theory T 3 = (P 3 , D 3 ) : 

P3 = {{bird\penguin)[l,l}}, 

D 3 = {(fly\\bird)[.9,.95], (fly \\penguin)[0, .05], 

(easyJosee \\ yellow)[.95, 1]} . 

This default theory should entail that "generally, yellow 
penguins are easy to see", as the set of all yellow penguins 



Table 1: Examples of 0- and 1 -entailed tight intervals. 
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is a nonexceptional subclass of the set of all yellow objects. 
But, 0-entailment gives only the interval [0, 1], and 1-entail- 
ment reports an unsatisfiability (see item (8) in Table 1). □ 

The next example is taken from (Bacchus et al. 1996). 

Exampie 4 Let us consider the following probabilistic de- 
fault theory T A = (P 4 , D A ): 

P A = {(bird | magpie) [1, 1]}, 

Di = {(chirp || bird)[. 7, .8], (chirp \\ magpie)[Q, .99]} . 

This default theory should entail "generally, the proba- 
bility that magpies chirp is between 0.7 and 0.8", since we 
know more about birds w.r.t. the property of being able to 
chirp than about magpies. Indeed, both 0- and 1-entailment 
yield the desired conclusion (see item (9) in Table 1). □ 

The following example concerns ambiguity preservation 
(Benferhat et al. 1995). 

Exampie 5 Let us consider the following probabilistic de- 
fault theory T 5 = (P 5 , D 5 ): 

P5 = {(bird\penguin)[l, 1]}, 

D5 = {(fly\\metaLwings)[.95,l], (fly \\ Wre/)[.95, 1], 

(fly \\penguin)[0, .05]} . 

Assume now that Oscar is a penguin with metal wings. As 
Oscar is a penguin, we should conclude that the probability 
that Oscar flies is at most 0.05. However, as Oscar has also 
metal wings, we should conclude that the probability that 
Oscar flies is at least 0.95. As argued in the literature on 
default reasoning (Benferhat et al. 1995), such ambiguities 
should be preserved. Indeed, 0-entailment yields the desired 
interval [0, 1], while 1-entailment reports an unsatisfiability 
(see item (10) in Table 1). □ 

What about handling purely probabilistic evidence! 

Example 6 Let us consider again the probabilistic default 
theory T 2 of Example 2. Assume a first scenario in which 



our belief is "the probability that Tweety is a bird is at least 
0.9" and "the probability that Tweety is a penguin is at least 
0.1" and a second scenario in which our belief is "the prob- 
ability that Tweety is a bird is at least 0.9" and "the proba- 
bility that Tweety is a penguin is at least 0.9". What do we 
conclude about Tweety's ability to fly in these scenarios? 

The notion of 0-entailment is undefined for such purely 
probabilistic evidence, whereas the notion of 1-entailment 
yields the probability interval [.86, .91] in the first scenario, 
and reports an unsatisfiability in the second scenario (see 
items (1 1)-(12) in Table 1). □ 

Summarizing the results, 0-entailment is too weak, while 
1-entailment is too strong. In detail, 0-entailment often 
yields the trivial interval [0, 1] and is even undefined for 
purely probabilistic evidence, while 1-entailment often re- 
ports unsatisfiabilities (in fact, in the most interesting sce- 
narios, as 1-entailment is actually monotonic). 

Roughly speaking, our ideal notion of entailment for 
probabilistic knowledge bases under probabilistic default 
theories should lie somewhere between 0- and 1-entailment. 
One idea to obtain such a notion could be to strengthen 0- 
entailment by adding some inheritance mechanism. Another 
idea is to weaken 1-entailment by handling unsatisfiabilities. 
In the rest of this paper, we will focus on the second idea. 

Probabilistic Default Reasoning 

In this section, we extend the classical notions of entailment 
in system Z (Pearl 1990), Lehmann's lexicographic entail- 
ment (1995), and Geffner's conditional entailment (Geffner 
1992; Geffner & Pearl 1992) to conditional constraints. 

The main idea behind these extensions is to use the fol- 
lowing two interpretations of defaults. As far as default 
rankings and priority orderings are concerned, we interpret a 
default (tp\\(j))[l, u] as "generally, if <j> is true, then the prob- 
ability of ij) is between I and u". Whereas, as far as notions 



of entailment are concerned, we interpret (ip\\ (f>)[l, u] as "the 
conditional probability of given <p is between I and m". 

Preliminaries 

A probabilistic interpretation Pr verifies a default 
iff Pr{4>) = \ and Pr \= (i)\4>)[l, u]. It falsifies a default 
(xl)\\(j))[l,u} iff Pr((p) = 1 and Pr ^ (ip\(j>)[l,u}. A set of de- 
faults D tolerates a default d under a set of strict conditional 
constraints P iff P U D has a model that verifies d. A set 
of defaults D is under P in conflict with d iff no model of 
PUD verifies d. 

A default ranking a on D maps each d € -D to a nonnega- 
tive integer. It is admissible with T = (P, D) iff each set of 
defaults D 1 CD that is under P in conflict with some default 
d€D contains a default d' such that o~(d') < a(d). A prob- 
abilistic default theory T = (P, D) is a-consistent iff there 
exists a default ranking on D that is admissible with T. It is 
a -inconsistent iff no such default ranking exists. 

A probability ranking k maps each probabilistic interpre- 
tation on 2$ to a member of {0, 1, . . . } U {oo} such that 
n(Pr) = for at least one interpretation Pr. It is extended 
to all strict probabilistic formulas F as follows. If F is sat- 
isfiable, then k(F) = min {n(Pr) \ Pr \= F}; otherwise, 
k(F) = oo. We say k is admissible with F iff n(^F) = oo. 
It is admissible with a default (?/>||0)[Z, u] iff 

k((4>\T)[1,1\) < oo and 

k((0|T)[1, 1] A (V#)[Z, u]) < k((0|T)[1, 1] A «]) . 

Roughly speaking, the intuition behind this definition is to 
interpret (ip \ \ 4>) [I , u] as "generally, if (f> is true, then the prob- 
ability of tp is between I and u". A probability ranking k is 
admissible with a probabilistic default theory T = (P, D) iff 
k is admissible with all F g P and all deD. 

System Z 

We now extend the notion of entailment in system Z (Pearl 
1990; Goldszmidt & Pearl 1996) to conditional constraints. 

In the sequel, let T = (P, D) be a er-consistent probabilis- 
tic default theory. The notion of z-entailment is linked to an 
ordered partition of D, a default ranking z, and a probability 
ranking k z . 

We first define the z-partition of D. Let (Do, . . . , Dk) be 
the unique ordered partition of D such that, for i = 0, . . . , k, 
each Di is the set of all defaults in D — [J{Dj < j < i} 
that are tolerated under P by D — [J{Dj | < j < i} (note 
that we define D - {j{D 3 - \ < j < i} = D for i = 0). We 
call this (Do, . . . , Dk) the z-partition of D. 

Example 7 The z-partition for the probabilistic default the- 
ory Ti = (P2, D2) of Example 2 is given as follows: 

{{{legs || bird)[M, 1], (fly \\ bird)[.9, .95]}, 
{(fly\\penguin)[0,M}}). □ 

We now define the default ranking z. For j = 0, . . . , k, 
each d E Dj is assigned the value j under z. The probabil- 
ity ranking k z on all probabilistic interpretations Pr is then 



defined as follows: 

!oo if Pr ^ P 

if Pr^ PUD 

1 + max z(d) otherwise. 
deD: PrY^d 

The following result shows that, in fact, z is a default 
ranking that is admissible with T, and k z is a probability 
ranking that is admissible with T. 

Lemma 8 a) z is a default ranking admissible with T. 
bj n z is a probability ranking admissible with T. 

We next define a preference relation on probabilistic in- 
terpretations. For probabilistic interpretations Pr and Pr', 
we say Pr is z-preferable to Pr 1 iff K z (Pr) < n z (Pr'). 
A model Pr of a set of probabilistic formulas T is a z- 
minimal model of T iff no model of T is z-preferable to Pr. 

We are now ready to define the notion of z-entailment 
as follows. A strict probabilistic formula F is a z-con- 
sequence of KB, denoted KB \\^ Z F, iff each z-minimal 
model of P U {KB} satisfies F. A strict conditional con- 
straint (ip\(j>)[l , u] is a tight z-consequence of KB, denoted 
KB W^tight (V'l'/OP; u ]> iff I ( res P-> u ) is the infimum (resp., 
supremum) of Pr(ip\(f>) subject to all z-minimal models Pr 
of P U {KB} with Pr(4>) > 0. 

Coming back to Examples 1-6, it turns out that the non- 
monotonic properties of z-entailment differ from the ones of 
0- and 1 -entailment (see Table 2). 

In detail, in the given examples, z-entailment ignores ir- 
relevant information, shows property inheritance to glob- 
ally nonexceptional subclasses, and respects the principle 
of specificity. Moreover, it may also handle purely proba- 
bilistic evidence. However, properties are still not inherited 
to more specific classes that are exceptional with respect to 
some other properties. Moreover, z-entailment still has the 
drowning problem and does not preserve ambiguities. 

The following examples illustrate how z-entailed tight in- 
tervals are determined. 

Example 9 Given T2 of Example 2, we get: 

(penguin \ T)[l, 1] \r tight { le 8 s I T )[°> X ] 

Here, the interval "[0, 1]" comes from the tight logical con- 
sequence P 2 U {(fly \\penguin)[0, .05], (penguin | T)[l, 1]} 
hfefe(&S'|T)[0,l].D 

Example 10 Given T 5 of Example 5, we get: 

(penguin A metaLwings | T)[l, 1] \r tight (fly I "OI^ -05] . 

Here, the interval "[0, .05]" comes from the tight logi- 
cal consequence P 5 U {(fly \\penguin)[0,. 05], (penguin A 
metaLwings\T)[l, 1]} \= tight (fty\T)[0, .05]. □ 

Lexicographic Entailment 

We now extend Lehmann's lexicographic entailment 
(Lehmann 1995) to conditional constraints. 

In the sequel, let T = (P, D) be a cr-consistent probabilis- 
tic default theory. We now use the z-partition (D , . . . , D k ) 
of D to define a lexicographic preference relation on proba- 
bilistic interpretations. 



Table 2: Examples of z-, lexicographically, and conditionally entailed tight intervals. 





T 


KB 


(V#) 


1 1 2 


lex 

\r tight 


1 1 ce 

\rtight 


(1) 


Ti 


(bird\ T)[l,l] 


(fc*j|T) 


[.95,1] 


[.95,1] 


[.95,1] 


(2) 


Ti 


(bird A yellow \T)[1,1] 


(Zeg*|T) 


[.95,1] 


[.95,1] 


[.95,1] 


(3) 


Ti 


(penguin T)[l, 1] 


(fc*j|T) 


[.95,1] 


[.95,1] 


[.95,1] 


(4) 


T 2 


(bird\ T)[l,l] 


(fc*j|T) 


[.95,1] 


[.95,1] 


[.95,1] 


(5) 


T 2 


(penguin T)[l, 1] 


(Zeg*|T) 


[0,1] 


[.95,1] 


[.95,1] 


(6) 


T 2 


(bird T)[l, 1] 


(fly|T) 


[.9, .95] 


[.9, .95] 


[.9, .95] 


(7) 


T 2 


(penguin T)[l, 1] 




[0, .05] 


[0, .05] 


[0, .05] 


(8) 


T 3 


(penguin A yellow T)[l, 1] 


(easy Jo see T) 


[0,1] 


[.95,1] 


[.95,1] 


(9) 


T A 


(magpie |T) [1,1] 


(c/h/j? T) 


[-7, .8] 


[-7, .8] 


[-7, .8] 


(10) 


T 5 


(penguin A metal-wings T)[l, 1] 


(fly|T) 


[0, .05] 


[0, .05] 


[0,1] 


(11) 


T 2 


(W/rf| T)[.9, 1] A (penguin | T)[.l, 1] 


(/fy|T) 


[.86, .91] 


[.86, .91] 


[.86, .91] 


(12) 


T 2 


T)[.9, 1] A (penguin | T)[.9, 1] 


(fly|T) 


[0,-15] 


[0..15] 


[0..15] 



For probabilistic interpretations Pr and Pr' , we say Pr 
is lexicographically preferable to Pr' iff there exists some 
ie{0,...,k} such that |{dG D { \ Pr \=d}\ > \{d£ A | 
Pr' h d}| and \{de D 3 -\ Pr\= d}\ = \{d G D 3 - | Pr' |= d}\ 
for all i < j <k. A model Pr of a set of probabilistic for- 
mulas T is a lexicographically minimal model of T iff no 
model of T is lexicographically preferable to Pr. 

We now define the notion of lexicographic entailment as 
follows. A strict probabilistic formula F is a lexicographic 
consequence of KB, denoted KB ||~ lex F, iff each lexico- 
graphically minimal model of PU{KB} satisfies F. A strict 
conditional constraint (ip\(f))[l,u] is a tight lexicographic 

consequence of KB, denoted KB htig ht {ip\<p)[l,u], iff I 
(resp., u) is the infimum (resp., supremum) of Pr(ip\4>) sub- 
ject to all lexicographically minimal models Pr of P\J{KB} 
with Pr(4>) >0. 

Coming back to Examples 1-6, it turns out that lexico- 
graphic entailment has nicer nonmonotonic features than 
^-entailment (see Table 2). 

In detail, in the given examples, lexicographic entailment 
ignores irrelevant information, shows property inheritance 
to nonexceptional subclasses, and respects the principle of 
specificity. Moreover, it does not block property inheritance, 
it does not have the drowning problem, and it may also han- 
dle purely probabilistic evidence. However, lexicographic 
entailment still does not preserve ambiguities. 

The following examples illustrate how lexicographically 
entailed tight intervals are determined. 

Example 11 Given T 2 of Example 2, we get: 

(penguin | T)[l, 1] (legs | T)[.95, 1] . 

Here, the interval "[.95,1]" comes from the tight logical con- 
sequence P 2 U {(legs || bird)[.95, 1], (fly \\penguin)[0, .05], 
(penguin \ T)[l, 1]} \= tight (legs \ T)[.95, 1]. □ 



Example 12 Given T5 of Example 5, we get: 

(penguin A metaLwings | T)[l, 1] |]~^^ (fly \ T)[0, .05] . 

Here, the interval "[0, .05]" comes from the tight logi- 
cal consequence P 5 U {(fly \\penguin)[0,. 05], (penguin A 
metaLwings\T)[l, 1]} \= tight (fly\T)[0, .05]. □ 

Conditional Entailment 

We next extend Geffner's conditional entailment (Geffner 
1992; Geffner & Pearl 1992) to conditional constraints. 

In the sequel, let T = (P, D) be a probabilistic default 
theory. 

We first define priority orderings on D as follows. A pri- 
ority ordering -< on D is an irreflexive and transitive binary 
relation on D. We say -< is admissible with T iff each set 
of defaults D' CD that is under P in conflict with some de- 
fault d G D contains a default d! such that d' -< d. We say T 
is ^-consistent iff there exists a priority ordering on D that 
is admissible with T. 

Example 13 Consider the probabilistic default theory 
T 2 = (P 2 ,D 2 ) of Example 2. A priority ordering -< on D 2 
that is admissible with T 2 is given by (fly || bird)[.9, .95] -< 
(fly \\penguin)[Q, .05]. □ 

The existence of an admissible default ranking implies the 
existence of an admissible priority ordering. 

Lemma 14 If T is a-consistent, then T is ^-consistent. 

We next define a preference ordering on probabilistic in- 
terpretations as follows. Let Pr and Pr' be two probabilis- 
tic interpretations and let -< be a priority ordering on D. We 
say that Pr is -^-preferable to Pr iff {d G D \ Pr y= d} ^ 
{deD\ Pr' y= d} and for each d G D such that Pr^d and 
Pr' \= d, there exists some default d' G D such that d -< d' , 
Pr \= d' , and Pr' y= d! . A model Pr of a set of probabilistic 
formulas T is a ^-minimal model of T iff no model of T 
is -(-preferable to Pr. A model Pr of a set of probabilistic 



formulas T is a conditionally minimal model of T iff Pr is 
a ^-minimal model of .F for some priority ordering -< ad- 
missible with T. 

We finally define the notion of conditional entailment. A 
strict probabilistic formula F is a conditional consequence 
of KB, denoted KB ||~ ce F, iff each conditionally minimal 
model of P U {KB} satisfies F. A strict conditional con- 
straint (tp\(j))[l, u] is a rigfe conditional consequence of 
denoted KB ^tight (ip\(j>)[l,u], iff Z (resp., u) is the infi- 
mum (resp., supremum) of Pr(tp\(f>) subject to all condition- 
ally minimal models Pr of P U {KB} with Pr(<j>) > 0. 

Coming back to Examples 1-6, we see that among all in- 
troduced notions of entailment, conditional entailment is the 
one with the nicest nonmonotonic properties (see Table 2). 

In detail, in the given examples, conditional entailment 
ignores irrelevant information, shows property inheritance 
to nonexceptional subclasses, and respects the principle of 
specificity. Moreover, it does not block property inheritance, 
and it does not have the drowning problem. Finally, condi- 
tional entailment preserves ambiguities and may also handle 
purely probabilistic evidence. 

The following examples illustrate how conditionally en- 
tailed tight intervals are determined. 

Example 15 Given T 2 of Example 2, we get: 

{penguin\T)[l,l]\^ t f ght (legs \T)[.95,1}. 

Here, the interval "[.95,1]" comes from the tight logical con- 
sequence Pi U {(legs || bird)[.95, 1], (fly \\penguin)[0, .05], 
(penguin | T)[l, 1]} ^ tight (legs | T)[.95, 1]. □ 

Example 16 Given T 5 of Example 5, we get: 

(penguin A metaLwings | T)[l, 1] H~^/^ (fly | T)[0, 1] . 

Here, the interval "[0,1]" is the convex hull of the inter- 
vals "[0, .05]" and "[.95, 1]", which come from the tight log- 
ical consequences P5 U {(fly \\penguin)[0, .05], (penguin A 
metaLwings |T)[1, l]}\=tight(fiy I T )[0, -05] and P 5 U{(/fy II 
bird)[.95, 1], (fly \\ metaLwings)[.95, 1], (penguin A metal, 
wings I T)[l, 1]} \= tight (fly | T)[.95, 1], respectively. □ 

Relationship to Classical Formalisms 

We now analyze the relationship to classical default reason- 
ing from conditional knowledge bases and to classical prob- 
abilistic reasoning with conditional constraints. 

A logical formula is a probabilistic formula that con- 
tains only conditional constraints of the kind (ip\<p) [1,1] or 
(■0||0)[1, 1]. A strict logical formula is a strict probabilistic 
formula that contains only strict conditional constraints of 
the form (tp\<p)[l, 1]. A logical default theory T is a proba- 
bilistic default theory that contains only logical formulas. A 
logical knowledge base KB is a strict logical formula. 

We use the operator 7 on logical formulas, sets of log- 
ical formulas, and logical default theories, which replaces 
each strict conditional constraint (tp\(j))[l, 1] (resp., defeasi- 
ble conditional constraint (^>||<^)[1, 1]) by the classical im- 
plication (p^>tjj (resp., classical default 0— >^>). Given a 
logical default theory T, we use |~ z (resp., |~ lex , |~ ce ) to 
denote the classical notion of z-, (resp., lexicographic, con- 
ditional) entailment with respect to "f(T). 



The following result shows that the introduced notions of 
z-, lexicographic, and conditional entailment are generaliza- 
tions of their classical counterparts. 

Theorem 17 Let T = (P, D) be a logical default theory 
and let KB be a logical knowledge base. Then, for every 
semantics s £ {z, lex, ce}: 

kbikoait)[m] ar 7 (tfs)hv. 

The next result shows that, when the union of generic 
and concrete probabilistic knowledge is satisfiable, the no- 
tions of z-, lexicographic, and conditional entailment coin- 
cide with the notion of 1 -entailment. 

Theorem 18 Let T = (P, D) be a probabilistic default the- 
ory and let KB be a probabilistic knowledge base such 
that PUDU {KB} is satisfiable. Then, for every seman- 
tics s £ {z, lex, ce}: 

1. KB \^ S F iff PUDU{KB} \= F. 

2. KB\^ ight (^)[lM iffPUDu{KB}^ tight (m[lM- 

Summary and Outlook 

We presented the notions of z-entailment, lexicographic en- 
tailment, and conditional entailment for conditional con- 
straints, which combine the classical notions of entail- 
ment in system Z, Lehmann's lexicographic entailment, and 
Geffner's conditional entailment with the classical notion of 
logical entailment for conditional constraints. We showed 
that the introduced notions for probabilistic default reason- 
ing with conditional constraints have similar properties like 
their classical counterparts. Moreover, they properly extend 
both their classical counterparts and the classical notion of 
logical entailment for conditional constraints. 

An interesting topic of future research is to extend other 
formalisms for classical default reasoning to the probabilis- 
tic framework of conditional constraints. 
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